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Abstract — A wireless relay with multiple antennas is called 
a multiple-input-multiple-output (MIMO) switch if it maps its 
input links to its output links using "precode-and-forward." 
Namely, the MIMO switch precodes the received signal vector in 
the uplink using some matrix for transmission in the downlink. 
This paper studies the scenario of K stations and a MIMO 
switch, which has full channel state information. The precoder at 
the MIMO switch is either a zero-forcing matrix or a network- 
coded matrix. With the zero-forcing precoder, each destination 
station receives only its desired signal with enhanced noise but no 
interference. With the network-coded precoder, each station re- 
ceives not only its desired signal and noise, but possibly also self- 
interference, which can be perfectly canceled. Precoder design for 
optimizing the received signal-to-noise ratios at the destinations 
is investigated. For zero-forcing relaying, the problem is solved 
in closed form in the two-user case, whereas in the case of more 
users, efficient algorithms are proposed and shown to be close to 
what can be achieved by extensive random search. For network- 
coded relaying, we present efficient iterative algorithms that can 
boost the throughput further. 

Index Terms — Beamforming, MIMO switching, network cod- 
ing, relay, zero-forcing. 



I. Introduction 

Relays in wireless networks can extend coverage as well 
as improve energy efficiency (T). In this paper, we study a 
setup in which multiple single-antenna stations communicate 
with each other via a multi-antenna relay. In each uplink slot, 
the stations simultaneously transmit, then in the subsequent 
downlink slot, the relay precodes its received signal by a cer- 
tain matrix before broadcasting to the stations. In the absence 
of noise, the multiple-input multiple-output (MIMO) system 
between the transmitters and the receivers can be viewed as a 
product of the downlink channel matrix, the precoder and the 
uplink channel matrix. In this work, we design a zero-forcing 
precoder so that the product channel is a desired permutation 
matrix, which forms a one-to-one mapping (or links) from 
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the transmitters to the receivers. Hence the technique is called 
MIMO switching. We also study a generalization referred to as 
network-coded precoding where the off-diagonal elements of 
the channel matrix form a permutation, and where the diagonal 
elements can be nonzero. Such nonzero diagonal elements 
cause self-interference, which can be fully canceled assuming 
the interference gains are available at the receivers. We study 
how to design the precoder to maximize the signal-to-noise 
ratios (SNRs) of the links. 

To the best of our knowledge, this work is the first to 
treat unicast non-pairwise switching patterns. Prior work that 
investigate data exchange via a relay includes Q~)-||6)- Refer- 
ences 0, 0, Q investigate the case of "full data exchange," 
in which all stations want to broadcast their data to all the 
other stations. A slotted system with a single-antenna relay 
is considered in and the maximum throughput region is 
evaluated. Data transmissions in 0, can be summarized 
as follows: In the first slot, all stations transmit to the relay 
simultaneously; subsequent slots are devoted to downlink 
transmissions; in each downlink slot, the relay multiplies the 
signal received in the first time slot by a different precoder, 
such that at the end of all downlink slots, all stations receive 
the broadcast data from all the other stations. By contrast, the 
framework investigated in this paper focuses on the unicast 
case, in which station i transmits to another station j only. 
(Station j may transmit to a different station than i.) Any 
general transmission pattern (unicast, multicast, broadcast, or 
a mixture of them) among the stations can be realized by 
scheduling a set of different unicast transmissions, as has been 
pointed out by the authors in preliminary work 0. A single- 
antenna relay with different forwarding strategies is considered 
in HI, which studies both full data exchange and "pairwise 
data exchange," in which stations form pairs to exchange data 
with each other only. It is a special case of unicast switching 
studied here. Reference [6] studies pairwise data exchange 
only, where the relay adopts the decode-and-forward strategy. 
The diversity-multiplexing tradeoff under reciprocal and non- 
reciprocal channels is also analyzed. 

In this paper, we consider both pairwise and non-pairwise 
switching, in which a multi-antenna relay works in precode- 
and-forward manner. We first study switching traffic among 
the stations using a zero-forcing MIMO relay, where each 
destination receives the desired signal with enhanced noise. 
We then study a more general network-coded relaying, which 
exploits physical-layer network coding for performance im- 
provement [8|-[11|. For fairness, we study how to design the 
precoder to maximize the minimum received SNR among all 
stations, which is referred to as the maxmin problem. Since the 
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Fig. 1. Wireless MIMO switching. 



maxmin problem is NP-hard, we use a semidefinite relaxation 
technique to compute an approximate solution. The problem 
further simplifies if the SNRs at all destinations are required to 
be identical, and we call it the equal-SNR problem. We derive 
conditions under which the maxmin and equal-SNR problems 
are equivalent. By evaluating the throughput performances of 
the two problems, we show the gap between them is small 
especially in the high SNR regime. That is, our numerical 
results suggest that we can use the equal-SNR problem to 
approximate the (NP-hard) maxmin problem. Furthermore, we 
show that network-coded relaying can noticeably improve the 
throughput performance over zero-forcing relaying. 

The remainder of the paper is organized as follows: Section 
II introduces the scheme of wireless MIMO switching. In 
Section III (resp. Section IV), the maxmin (resp. equal-SNR) 
problem is investigated for both zero-forcing and network- 
coded relaying. Section V presents the simulation results. 
Section VI concludes this paper. 

II. System Description 

Consider K stations, numbered l,...,K, each with one 
antenna, as shown in Fig. Q] There is no direct link between 
any two stations and the stations communicate via a relay 
with TV antennas. The precode-and-forward scheme applies 
under the condition of K < N, where the relay has enough 
degrees of freedom to switch all data streams at the same time. 
We assume K — N throughout for simplicity. In the case of 
K < N, all the matrix inverses in the paper shall be replaced 
by Moore-Penrose pseudo inverses fTZl . Each transmission 
consists of one uplink symbol interval and one downlink 
symbol interval of equal duration. In particular, the two symbol 
intervals are two slots in a time-division system. The uplink 
symbol interval is for simultaneous uplink transmissions from 
the stations to the relay; the downlink symbol interval is 
for downlink transmissions from the relay to the stations. 



Each round of uplink and downlink transmission realizes a 
switching permutation, as shall be described shortly. 

Consider one transmission. Let x = [x\, ■ ■ ■ ,xn] t be the 
vector representing the signals transmitted by the stations. 
Let y — [yi, ■ ■ ■ ,un) T be the received signals at the relay, 
and u = [ui,--- ,un] t be the noise vector with indepen- 
dent identically distributed (i.i.d.) noise samples following 
circularly-symmetric complex Gaussian (CSCG) distribution, 
i.e., u n ~ W c (0,7 2 ). Then 



y = Hx 



u. 



(1) 



where H is the uplink channel gain matrix. The relay multi- 
plies y by a precoding matrix G before relaying the signals. 
In this paper, we assume that the uplink channel and downlink 
channel are reciprocal, so that the downlink channel is H T . 
Thus, the received signals at the stations in vector form are 



r =H 1 Gy + w 
=H T GHx + H T Gu + w, 



(2) 
(3) 



where w is the noise vector at the receiver, with the i.i.d. noise 
samples following CSCG distribution, i.e., w n ~ A/" c (0, a 2 ). 
In the following, we describe two precoding schemes. 

A. Zero-forcing Relaying 

We refer to an N x N matrix P that has one and only 
one nonzero element on each row and each column, which is 
equal to 1, as a permutation matrix. Evidently, Px is a column 
vector consisting of the same elements as x but permuted in 
a certain order depending on P. For example, if 

^0 1 
1 
J) 1 

then P[xi,X2,X3] T = x±, X2\ T ■ In the case where all di- 
agonal elements of P are zero it is also called a derangement. 

Suppose that the purpose of G is to realize a particular 
permutation represented by the permutation matrix P, and to 
amplify the signals coming from the stations. That is, 



H GH = AP, 



(4) 



where A = diag{ai, • • • , oat} is an "amplification" diagonal 
matrix. Each diagonal element is regarded as the gain of a 
link. Accordingly, the precoder can be calculated as 



G 



H T APH 1 . 



(5) 



Let the receivers compensate for the amplification to yield 
received signals expressed collectively as: 

r = A~ 1 r = Px + v, (6) 

where the post-processing noise is expressed as 



v = PH ~~ 1 u + A ~ 1 w. 



Let us define 



Q^I + ^PH'H^pT 



(7) 



(8) 
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Fig. 2. A traffic demand among three stations. 

The covariance of the post-processing noise v is written as 
E{vv H } =~ f 2 PH- 1 H- H P T + a 2 A- 1 A- H (9) 

(10) 



2 A-l A-H 



(13) 
(14) 
(15) 
(16) 

T 



=Q — I + a A~ A 

Suppose all uplink transmissions are independent and use 
unit average power, i.e., E{xf} = 1, i = ,N. The 

problem is to design the precoder G to minimize the variance 
of the post-processing noise subject to a power constraint for 
the relay: 

E{y H G H Gy}< P . (11) 

For notational convenience, let the entries of an N x N 
matrix S be given by 

8 ij ±Q Ji [(H*)- l H- T ] ij . (12) 
From (Q~|i, the relay's transmit power can be evaluated as 
E[x H H H G H GHx + u H G H Gu] 
=Tr[GHH H G H + j 2 GG H ] 
=Tr[H- T A(I + j 2 PH- 1 H- H P T )A H (H*) 
=Tr[AQA H (H*)- 1 H- T ] 
=a H Sa, 

where we have used (0 and ((8), and a = [a%, ■ ■ ■ , djvj" is 
the gain vector with the diagonal elements of A. The power 
constraint on the relay is thus expressed as 

a H Sa < p. (17) 

B. Network-coded Relaying 

The MIMO switch described in Section II. A makes use of 
zero-forcing relaying, by which data are switched based on a 
permutation matrix, whose diagonal elements are all zero. If 
the fcth diagonal element is nonzero, it means that the relay 
forwards the signal from station k back to itself. There is no 
need to force a diagonal element to zero because the self- 
interference is known and can be removed. This is the basic 
idea behind physical-layer network coding (PNC) ifTTI . which 
underlies many other works, e.g., lfl3l -[ 161. 

In general, allowing PNC improves the performance. Even 
though the self-interference costs the relay some energy, 
removing the constraint on the diagonal of the derangement 
enlarges the possible set of the optimization problem, and 
thereby yields a better optimal objective. This can also be 
seen from an example of multi-way relaying, in which three 
single-antenna stations communicate with the help of a three- 
antenna relay. The traffic switching pattern among the three 
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Fig. 3. Assume the channel outputs of three users' signal vectors: the back 
vector b is the desired signal; the red vector a is the self-interference; the 
blue vector c is the other interference signal. 



stations are defined in Fig. [2] The desired signal of station 1 
is the black signal b; the red signal a is its self-interference; 
the blue signal c is the other interference signal. We assume 
the three signals and the channel gains are real-valued for 
simpler illustration. A sketch of the three signals after passing 
through the channel is shown in Fig. [3] i.e., d, b and c, which 
are all three-dimensional vectors due to three antennas at the 
relay. As shown in Fig. [3] we assume that c is along s-axis; 
a is in a;y-plane; and b = b t + b 3 + b K . In the case of zero- 
forcing, the desired signal of station 1, b, should be projected 
to z-axis, which is perpendicular to rry-plane spanned by the 
two signals d and c. In this way both the two interference 
signals are zeroed out and we get a post-processing signal 
b = b K . However, with network coding, we do not have to 
zero out the self-interference. By dropping this constraint, we 
only need to project the desired signal to the yz-plane which 
is perpendicular to the interference signal c, then we obtain 
the projected signal b"= b 3 + b K . Obviously, the projection 
b" with network coding is stronger than b' by zero-forcing. 
With PNC, we rewrite © as 



H T GH = A{P + B), 



(18) 



where B = diag{6i,-- - ,6jv} is a diagonal matrix to be 
determined. 

As an example of a symmetric derangement, the correspond- 
ing network-coded switch matrix has the pattern: 



Pi+B 



bi 1 

b 2 1 

1 b 3 

1 b 4 



(19) 



This switching pattern corresponds to two pairwise data ex- 
changes, in which stations 1 and 4 are one pair and stations 



2 and 3 are the other pair. Hence, the network-coded MIMO 
switching can construct multiple parallel two-way relay trans- 
missions. An example of asymmetric derangement may have 
the following switch matrix: 



B 



61 1 

1 b 2 

b 3 1 

1 64 



(20) 



This generalizes the traditional physical-layer network coding 
setting as presented in [11] because the data exchange is not 
pairwise. For both symmetric and asymmetric switch matrices, 
we shall refer to the corresponding matrices with nonzero 
diagonal as network-coded switch matrix, and the associated 
setup as MIMO switching with network-coded relaying. 

Once the receivers compensate for the amplification and 
remove self-interference, the resulting signals form this vector: 



r = Px + v', 
where the post-processing noise v' is expressed as 



(P + B)H~ 1 u + A _1 u>. 



(21) 



(22) 



The covariance of v' is written as 

E{v'v' H } = R I + a 2 A- l A~ H , (23) 

where 

R = I + -y 2 (P + B)H- 1 H- H (P + B) H . (24) 
The constraint of the relay power consumption is rewritten as 
Q(A,B) 

= Ti[H- T A(R + BP T + PB H + BB H )A H (H*)- 1 }(25) 

= Tr[A(R + BP T + PB H + BB H )A H (H*y 1 H- T ] (26) 

< P- (27) 

We have thus established another framework for MIMO 
switching by network-coded relaying method, in which P + B 
is a switch matrix. This framework can be generalized to the 
case where the switch matrix realizes a general transmission 
pattern. For example, if there are two nonzero non-diagonal 
elements in a column of the switch matrix, then a multicast 
connection is being realized within one switch matrix. In 
fact, by scheduling a set of switch matrices, each realizing 
a permutation, we can satisfy arbitrary user traffic patterns. 

III. The Maxmin Problem 

In this section, we formulate a maxmin problem, in which 
the minimum received SNR among all the stations is maxi- 
mized. According to (0, the post-processing noise power of 
receiver i is 

2 

e l = q l -l + -^, (28) 
I Oil 2 

where = Q u . Thus, the received SNR is 1/ej. Since 
the system is half-duplex with uplink and downlink of equal 
duration, the throughput achieved by Gaussian signaling is 



Cj = 2 l0 S2 



1 

1 + - 



(29) 



in bits per symbol period. 

We first study the maxmin problem without PNC in Section 
III. A, and then allow PNC in Section III.B. 

A. Zero-forcing Relaying 

Let e denote the maximum post-processing noise power 
among the stations. An optimization problem is formulated 
as follows: 



mm 
a 



s.t. 



> 



e + 1 - ft 
a H Sa < p, 

e > 0. 



, i = l,... , N, 



(30a) 

(30b) 

(30c) 
(30d) 



Lemma 1: Every optimal solution for the optimization 
problem (f30b must satisfy the relay power constraint d30cb 
with equality. 

Proof: Let a = [oi, - ■ ■ ,cln} t denote the optimal solu- 
tion for the problem (f30b with the optimal objective e. Suppose 
a satisfies the constraint (I30cb with strict inequality. Then there 
exists t > such that 



a H Sa < p, 



(31) 



for all a with |aj| 6 (|a<| — r, + r), i = 1, • • • ,N. Let 

" 2' 



a, 



5, i = 1,- ■ ■ , N, and let 
a 2 



ad 



Then 



Ei(o£) < Si(a;), * = 1, - ■ • , JV. 



(32) 



(33) 



Since each Hj(a^) is smaller than Hj(<Zj) for all i, the 
maxmin objective becomes smaller with the solution a = 
[a[,--- ,a' N ] T . Thus, a is not the optimal solution, and 
contradiction arises. ■ 

With this lemma, we could eliminate the feasible solutions 
with which the relay consumes less power than p. 

Proposition 1: The maxmin proble is equivalent 

to the following quadratically constrained quadratic program 
(QCQP), 



mm 
a 



s.t. 



a H Sa 



\aA 2 > 



1, 



,N. 



(34a) 
(34b) 



e+l-?i 

Proof: Let a be an optimal solution of d30b . and the 
associated optimal objective is e > 0. Note that e is the 
maximum noise power for all i. 

The solution a satisfies (I34bb . where the largest noise power 
among all i is equal to e. Hence, a is a feasible solution 
of d34i l. According to Lemma Q] the power consumption of 
the relay is a H Sa = p. Let a be an optimal solution of 

'Essentially, it is a minmax problem with respect to (w.r.t.) the post- 
processing noise power. In this paper, we call it the maxmin problem w.r.t. 
the received SNR to keep it consistent with the following equal-SNR problem 
proposed in Section IV. 



Q41 l, The power consumption of the relay can not be larger 
than p, otherwise a is even worse than a for ( f34b . If the 
power consumption of the relay a H Sa is strictly smaller than 
p, then a is a feasible solution of ( f3Qb . which is at least as 
good as a since the maximum noise power is not larger than 
e. That is, a is an optimal solution of d30T >. However, the 
power consumption a H Sa < p contradicts Lemma Q] Thus, 
a H Sa = p. Furthermore, there is at least one constraint in 
( f34t in which equality holds for a, otherwise a is a better 
solution for (130b . We have proved the optimal solution of (130b 
is also the optimal solution of (f34l >. vice versa. Therefore, the 
two problems are equivalent. ■ 
Proposition 2: The maxmin problem (f30b is NP-hard in the 
size of N when 7 7^ 0. 

Proof: Problem ( [34-b is equivalent to problem (2) in 
ifTTl , which has been proved to be NP-hard in general. Thus, 
problem ( [30b is NP-hard as well. We do not repeat the steps 

in El. ■ 

We can use the semidefinite relaxation (SDR) technique 
in IfTTl to find a throughput upper bound and a suboptimal 
solution for d34i >. Let X = aa H , then d34l i can be rewritten 

as 



min Ti\SX] 



s.t. Tr[^ X] - ii 



e + 1 - q% ' 
l t >0, i=l,--- ,N, 

XhO, 

rank(X) = 1, 



(35a) 

(35b) 

(35c) 
(35d) 
(35e) 



where denotes element-by-element multiplication, i.e., the 
Hadamard product; X y means the matrix X is symmetric 
positive semidefinite; Ei is an N x N matrix, in which element 
is 1 and all the other elements are 0; li, i = 1, • • • , N 
are "slack" variables. If we drop the rank-1 constraint (I35et . 
problem d35l > is in the standard form of a semidefinite pro- 
gramming problem (SDP). 

Overview of Our Approach to Maxmin Optimization Prob- 
lem: We now overview the approach of our numerical in- 
vestigation of the maxmin optimization problem. Although 
we focus on zero-forcing relaying here, we use the same 
approach for the study of maxmin optimization for network- 
coded relaying as well after its corresponding formulation is 
set up in Part B. 

We recognize that it is difficult to solve the maxmin problem 
(l30b . Therefore, we attempt to find its suboptimal solution. The 
logical steps and the rationale for our approach to finding a 
suboptimal solution are summarized below: 

1) We have already proved that the maxmin problem is 
equivalent to the QCQP (l34l l. Unfortunately, solving for 
the optimal solution of the QCQP is still difficult. 

2) Fortunately, the SDR technique proposed in 1171 can be 
used to find a good suboptimal solution of the QCQP. 
The technique consists of the following three steps: 

a) Rewrite the QCQP as an optimization problem I 
with a constraint of rank 1. 



b) By dropping the constraint of rank 1, the problem 
d35l l is turned into a SDP. The optimal solution of 
the SDP can be found by some toolboxes, such 
as SeDuMi and SDPT3. Importantly, the optimal 
objective of the SDP is an upper bound of that 
of the QCQP, since the QCQP has one extra 
constraint, i.e., the constraint of rank 1. 

c) We then use the randomization technique in lfl7l 
to approximate the optimal solution of the QCQP 
based on the optimal solution of the SDP. With 
the randomization technique, we get a result, which 
satisfies the constraints of the QCQP, and this result 
is at least a suboptimal solution of the QCQP. 

In lfl7l . the authors showed that the suboptimal solution 
obtained as above can achieve an objective close to the 
global optimum. As will be shown, our simulation results also 
validate the near-optimality statement in IfTTl . 

Numerical Method (One-dimensional Search): As per the 

discussion in the above overview, we use the suboptimal 
solution of (|35T > to approximate the solution of (f30b . The 
optimal solution of (130b is the value of e in (fJBT l. for which the 
objective of d35l . i.e., the minimum power consumption is p. 
We solve for e by one-dimension search from max^j — 1, i.e., 

when the power consumption of the relay (I35ab approaches 
infinity. In each step of the search, e is increased by a small 
amount 5. Given an e, we can use SDP solvers to find the 
minimum power consumption of the relay. When the step size 
5 is small enough, the first value of e, for which the minimum 
power consumption is p, is the solution ( l30b . 



B. Network-coded Relaying 

The network-coded problem is formulated as 



mm 

A.B 



S.t. 



Rii — 1 



< e, i = 1, 



,N, 



q(a,b)< p , 

e>0, 



(36a) 

(36b) 

(36c) 
(36d) 



where R and f2(A, B) are defined in (f24l i and d26*i >. respec- 
tively. In contrast to the previous problem of zero-forcing 
relaying, the constraint of the relay power consumption d36cl > 
is quartic, making it more difficult than ( f3Qb - We propose an 
iterative algorithm, in which A and B defined in < fT~8T > are 
optimized iteratively. 

1 ) Optimize A for given B: Given B, problem ( |36T > can 
be formulated as <f30b by redefining 



q% = R 



[R + BP T + PB H + BB H \ yi [{H T H*)- r 



a, (37) 

(38) 



Thus, the network-coded problem with fixed B can be solved 
by the SDR technique, which is the same as solving (l34i l. 



2) Optimize B for given A: For ease of notation, let 

M =A H (H*)~ 1 H~ T A, 
N =1 + j 2 H- 1 H~ H . 

The relay power consumption d26l l is rewritten as 

Q(A, B) =Tr[(P + B)N(P + B) H M] 
=Tr[BNB H M + BNP T M 
+ MPNB H + PNP T M}. 

Constraint ( I36bb can be rewritten as 



I + 7 2 (P + B)H ' H "(P + B 



< e- 



Oi 



(39) 
(40) 

(41) 
(42) 

(43) 



Both the objective and the constraint w.r.t. B are inho- 
mogeneous quadratic. To homogenize this problem, we let 
b = [b T ,t] T , where b = diag{Z?}. Then the relay power 
consumption d42l can be written as 



[b H t*] 



~s t f 




b 


f H 




t 



1v[PNP T M] 



(44) 



= b Sb + Tr[PNP T M], 



where S t = N T M, / = diag{MPAT}, t = 1. Let X = 
bb , then this problem can be homogenized as 



min Tr[5Xl 

s.t. Tr[Ei X] 
1 

L > 0, i 



P{H H H) X P 



H 



1. 



,N, 



Tr[E N+1 X] = 1, 

XhO, 
rank(X) = 1, 



(45a) 
(45b) 



(45c) 
(45d) 
(45e) 
(45f) 



where E; L . % = 1, • • • , N, can be easily designed according to 
(1431 . The last element of E^+i is 1 and all the others are 0. 
Thus, constraint ( 145 di is actually |i| 2 = 1. The problem can 
be solved in the same way as solving d35l l. Note that constraint 
( 145 dl ) only requires the magnitude of t to be 1. Assume that 
the solution of ( |45T > is b op t = [b , t\ T , then b op te^ a , for all a 
also guarantees the optimality of (05]), since adding a phase 
rotation does not change X. Thus, the solution of this problem 
is be-' J ^. 

To sum up the iterative algorithm, in order to design A and 
B, we can first initialize A or B by any diagonal matrix. 
Then we iteratively perform the two solvers to optimize A 
and B until convergence. 

IV. The Equal-SNR Problem 

In this section, we require the received SNR at all stations to 
be identical and find its maximum of the equal-SNR problem. 
Such a formulation was first investigated in J7J. The equal- 
SNR requirement not only guarantees perfect fairness, but also 
allows more efficient computation. 



A. Zero-forcing Relaying 

The optimization problem is formulated as 



s.t. 



e+l-qi 
a H Sa < p, 
e > 0. 



,N, 



(46a) 

(46b) 

(46c) 
(46d) 



The only difference between the equal-SNR problem and the 
maxmin problem (f30b is that the inequality (I30bb is replaced 
by the equality J46bt . In lfl"8l . we only provided suboptimal 
solutions. In this paper, we will investigate optimal solutions 
and propose analytical and numerical suboptimal solutions. As 
has been proved in [18|, the equal-SNR problem is feasible. 
Another property is that the solution of the equal-SNR problem 
is feasible for the maxmin problem, since constraint (I30bb of 
the maxmin problem has a larger possible set than constraint 
(I46bb in the equal-SNR problem. Thus, the optimal objective 
of (l46T l can not be smaller than that of (f30b - 

Proposition 3: If no additional noise is introduced at the 
switch, then the optimal solution of ( f30b is such that each 
station has exactly the same post-processing noise power. That 
is, in this case, the two optimization problems (l30l l and (l46l ) 
are equivalent. 

Proof: As proved in Proposition [2] the optimization 
problem d34l l is equivalent to d30l . When 7 = in d34l . 
qi = 1 for all i = 1, • • • , N. The objective can be rewritten 
as 2~2iLi s a\ a i\ 2 - The optimal solution of a is obvious, and it 
satisfies \ai\ 2 = j—^. That is, for the optimal solution of OOl . 
each station has equal post-processing e. ■ 

Thus, we can use the solution of the equal-SNR problem 
to approximate that of the maxmin problem in the high SNR 
regime. The gap will be evaluated numerically in Section V. 

Optimal Solution in the Case of Two Stations 

In order to minimize the post-processing noise power e, 
we should try to maximize |ai| and 1 0-2 1 - Given any e, the 
amplitudes |ai| and |a.2 1 can be calculated by the equal noise 
power constraint. Then we should find their optimal phases to 
minimize the relay power consumption. 

If N — 2, the power constraint of the relay in ( I46cb can be 
expanded as 

a H Sa = sn|ai| 2 + S22|a2| 2 + S2iaia2 + s i2ala 2 , (47) 

where sn > 0, S22 > 0, and S12 = s|i- By the definition of 
S in H2i . S12 = S21 > 0. Formula d4Tb can be written as 



a H Sa = 



'snai 



Since in (l48b only 



512 



/SllOi 



zn 2 



S22 



°12 
Sll 



02 



(48) 



= « 2 



is related to the phases 



of variables a\ and 0,2, the global minimum is achieved by 
real-valued ai and 02 with opposite signs. Without loss of 
optimality, assume a\ > and 02 < 0. Then the power 
consumption constraint can be simplified as 



Sa 



Slid 2 



2si 2 aia 2 



S22A2- 



(49) 



According to the equal noise power constraint (I46bb . we have 

a 



a 2 = 



(50) 



9i - <?2 



Plugging (15 Ot into the power constraint, we have a biquartic 
equation ( BP . 



=s 2 n qjal + 2q s [s 2 1 a 2 + SnS 2 2(J 2 - Snpqs 

- 2 S 2 12 a 2 }at + [(i 4 (sii + s 22 f +p 2 q 2 s 

- 2pa 2 qs(2sn + S22) - ^sl 2 a 4 ]a 4 

+ [2p 2 a 2 qs - 2pa\ Sll + s 22 )]a 2 + pV, 



(51) 



where q$ = q\ — q 2 . Since the equal-SNR problem is feasible, 
there exists solutions for (f5TJ. In order to maximize a\ and 
a 2 , the largest real root of ( fBTl ) is the optimal solution of a\, 
which admits an analytical solution [19|. Consequently, a 2 can 
be calculated by d50b . 

Alternatively, after we deduce that a x and a 2 have opposite 
signs, the one-dimensional search in Section III. A can be used 
to solve the problem as well. Thus, we have solved the equal- 
SNR problem of 65} in the case of N = 2. 

It is not difficult to see that making the two user signals 
have opposite signs (phases) minimizes the relay power con- 
sumption. The optimal solution of the maxmin problem when 
N = 2 also has the opposite-phase property. As we shall 
see, in the case of more than two users with users forming 
pairs, assigning each pair of users opposite phases is an 
effective scheme. Thus, in general the opposite-phase setting 
is effective for pairwise transmission in any problem which 
needs to minimize the relay power consumption, including 
the preceding maxmin problem. 

Suboptimal Solution in the Case of N > 2 

If there are more than two stations, we propose a suboptimal 
solution to J46b . The vector a consists of arbitrary complex 



numbers. Given e, one can obtain \aj\ from (146bt . Let the 
phases dj = ^ctj, j = 1, • • ■ ,N, be fixed. We solve (|46) 
to obtain the minimum noise variance e(9i, ■ ■ ■ ,6n), which 
satisfies a H Sa = p. 

We now consider the general case of complex-valued a. 
With the numerical method of one-dimensional search, the 
optimization of aj 's amplitudes and phases can be decoupled. 
There exists an e such that a H Sa = p. Denote such an e by 
e(#i, • • • , 9n) since in each step we regard the amplitudes as 
constant values. It then suffices to solve for 



arg min e(6>i, • • • , 9 if). 

01,— $N 



(52) 



In the following we provide two suboptimal algorithms for 
66) via (|52). 

Non-PNC Random-phase Algorithm: In d52) . we note that e 
is a complicated nonlinear function of 9j, A time-consuming 
exhaustive search can be used to find the solution to ((52). To 
reduce the computation time, we find the best set of phases 
over a randomly generated candidates in lieu of an exhaustive 
search. We call this the random-phase algorithm. We divide 
the interval of [0, 2w) equally into M bins with the values 
of 0, , ^jg-^ respectively, and we randomly pick 



among them to set the the value of 9j for each and every 
j = 1, ••• ,N. After that, we compute the corresponding 
6(6*1, •• • ,&n) by solving 66b) and J46c| ). Given an e, sub- 
stituting it into 66b) yields |o 3 -| for all j. We perform L trials 
of these random phase assignments to obtain L phase vectors 
of (6*i, •• • , On). Calculating the relay power consumption by 
( I46cb . we choose the phase vector using the least power among 
the L candidates as our approximated phase solution. By one- 
dimensional search, the estimated e* can be achieved when the 
least power consumption of the relay is p. Accordingly, we can 
calculate |aj| for all j with the estimated e* , then a with the 
approximated phase solution. Hence, G can be calculated by 
its definition (|5). This best-out-of-L-trials feasible solution is 
in general larger than the actual optimal e*. In Section V, we 
will show that large gains can be achieved with only small M 
and L. Moreover, increasing M and L further yields very little 
improvement, suggesting that the estimated e* with small M 
and L is close to the result achieved by an extensive search. 

Non-PNC Opposite-phase Algorithm: Recall that in the case 
of N = 2 the solution has a\ and a 2 with opposite signs, 
or in general the minimum power consumption is achieved 
with ai and a 2 being complex numbers with opposite phases, 
i.e., 9i = 9 2 + 7r. For large even number of N, consider the 
situation in which the transmissions are pairwise. The stations 
form pairs, and two stations in a pair exchange data with each 
other only. Assume that station pair I consists of stations ir{l) 
and k(£). Define 

[{H H)' 1 ]^)^) [{H Hy 1 ]^)^)] 

where hgx > 0, > 0, hi 2 6 C. The post-processing noise 
power can be factorized in terms of ba and bti as follows: 



ha hi2 




h\ 2 his 





e =7 2 /i£ 3 + 
=l 2 hn + 



an\ 



\ai2\ 



(54) 



(55) 



The relay power consumption can be written as 

N/2 

a ei +7 2 \h n \ l an 



(56) 



+ {hii + 7 hnha - 1) I an | 

+ {his + l 2 hnhis - 7 4 |^2| 4 ) K2I 2 } , 

where the summation consists of the inner-pair quadratic items 
of a, and 0\{a) denotes the sum of the quadratic items of 
a across pairs, e.g., aiaj where i, j from different pairs. For 
fixed amplitudes of an and a^, it is obvious that the phase 
vector (6*i, • • ■ ,9 n) that minimizes ( 156) is the optimal phase 
vector of (06). We could let two stations of a pair to have 
opposite phases to lower the inner-pair part of ( 156) . Simulation 
results indicate that as long as the relative phase is tt within a 
pair, the throughput performance remains essentially the same 
regardless of the phase differences between different pairs. 
Thus, to simplify the problem we use real numbers for the 
elements of a. The amplitudes are calculated in the same way 
as that of the non-PNC random-phase algorithm. 



Remark 1: The opposite-phase solution approaches the op- 
timal solution of the equal-SNR problem as SNR increases, 
since r y 2 Oi(a) becomes negligible. Thus, the opposite-phase 
solution approaches the optimal solution of the maxmin prob- 
lem as well according to Proposition 3. 



B. Network-Coded Relaying 

The optimization problem is formulated as 



mm e 

A,B 



s.t. Ru - 1 



e, i = l,-->,N, 



n(A,B)< P , 

e > 0. 



(57a) 

(57b) 

(57c) 
(57d) 



The only difference between the equal-SNR problem and the 
maxmin problem (l36l l is that the inequality ( I36bb is replaced 
by the equality ( I57bb - A proposition for the network-coded 
problem can be proved in the same way as Proposition 3. 
We then use the solution of the equal-SNR problem ( |57T > to 
approximate that of the maxmin problem d36l l at high SNR. 
The gap will be also evaluated in Section V. 

PNC Identical-b Random-phase Algorithm simplifies the 
matter by introducing an extra constraint: B = bl, where b is 
a real scalar. We set a range for searching b. For each trial of b, 
we use essentially the same method as the non-PNC random- 
phase algorithm proposed in Section IV.A to find the phases 
9j of dj and <pj of bj, and find the corresponding estimated 
e* as well. We then find b that yields the least post-processing 
noise power. The beamformer G can be calculated from P, 
\dj\, b, 9j and ipj. 

PNC Phase-aligned Algorithm: Consider pairwise transmis- 
sion, i.e., the case of N being an even number. Define 









l>(:2 



(58) 



where bn and b^ are the diagonal elements ir(£) and k(£) 
of Be, respectively. The post-processing noise power can be 
factorized in terms of bn and bti as follows: 
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+ l 2 h e3 + 
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(59) 
(60) 



This formula shows the potential advantage of introducing bn 
and bi2, i-e-, the application of physical-layer network coding. 
If we set bn and bn as 



hi = —^, b n = —^, (61) 

hn "a 

the post-processing noise power can be minimized in terms of 



bn and bg 2 - The relay power consumption can be written as 

N/2 

7 2 2 (A) + ^{|an+A £ a £2 | 2 
I 2 \hi2\ 2 + J 2 h ei hi3 - 
l 2 \hi 2 \ 2 + l 2 h n hi3 ~ X 2 I \ai2 
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U2\ 



(62) 
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Note that 02(A) denotes the sum of the quadratic items of A 
across pairs. Since H~ l H~ H is positive semidefinite, we can 
prove \hi2\ 2 < hnh^, then < 0. Thus, for fixed amplitudes 
of an and ai2, the inner-pair part of (lo*2l can be lowered 
when the phases of an and a^ are aligned, e.g., an, 0-12 > 
0. Simulation results indicate that as long as the phases of 
A's diagonal elements are aligned within pairs, the throughput 
performance is not sensitive to the phase differences among 
different pairs. 

Remark 2: Note that the phase-aligned solution is the opti- 
mal solution of the maxmin problem (|36l l when the noise level 
at the relay is zero. The result is obvious after setting 7 to 
in (US. 

In summary, Remarks 1 and 2 show that when the noise 
at the relay is small, the opposite-phase solution approaches 
optimal for non-PNC case and the phase-aligned solution 
approaches optimal for the PNC case. Our simulation results 
validate this conclusion. 

V. Numerical Results 

In this section, we evaluate the throughputs of various 
designs of MIMO switches. We assume the maximum transmit 
power of the relay and every station are the same (thus p — 1), 
and the noise level at the relay and the stations is the same. Our 
simulation indicates that the system throughputs are roughly 
the same with different symmetric permutations. The same 
result can be concluded for asymmetric permutations. Thus, 
we use one permutation for each of them (Pi and P2 are 
given by the matrices described in ( fl9l ) and d20l > with the 
diagonal elements set to 0) in simulations. 

Observation 1: For equal-SNR zero-forcing (i.e., non-PNC) 
relaying, the optimal setting for the case of two stations (N = 
2) has the property that the two elements of a have opposite 
signs. In general, the non-PNC opposite-phase algorithm is an 
effective scheme for pairwise transmission with larger even 
number of stations. 

A similar framework as ours is investigated in [20 1, which 
focuses on optimizing the sum rate of all stations. Therein, 
a suboptimal beamforming scheme is proposed, which also 
uses zero-forcing detection and zero-forcing precoding, and 
simply uses a positive scalar weight to control the relay 
power consumption instead of our diagonal A. We regard this 
scheme as a benchmark and call it "the basic scheme." All 
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Fig. 4. Throughput comparison of different relaying schemes in the case of 
two stations. 



Fig. 5. Throughput comparison of different relaying schemes for pairwise 
switching pattern in the case of four stations. 



schemes proposed in this paper have an advantage over the 
basic scheme in that they guarantees fairness. 

Compared with the basic scheme in Fig. |4j the optimal 
setting of TV — 2, i.e., the non-PNC opposite-phase algorithm 
proposed in Section IV.A achieves more than 0.6 dB gain in 
the low SNR regime. The gain becomes smaller as the SNR 
increases, e.g., around 0.25 dB gain at the SNR of 15 dB. 
We explain why the gain diminishes for high SNR as follows. 
When the relay noise power is zero, |ai| = |or,2 j. In this case, 
S12 becomes in d48l >. Then the throughput performance does 
not depend on the phase difference of a\ and ei2- The opposite- 
sign setting is equivalent to the identical-gain setting, i.e., the 
basic scheme. Thus, the gain over the basic scheme becomes 
trivial in the high-SNR regime. 

Fig. [5] presents the throughput in the case where N = 4 
stations form two pairs for pairwise transmission. The through- 
puts are roughly the same when we vary the phase differences 
of the pairs while keeping the phase difference within each 
pair to 7r. (Experimentation with the phases is not shown in 
Fig. [5] to avoid cluttering.) With this result, we could set the 
elements of a such that one element in each pair is positive 
and the other element is negative. The results are similar to that 
of N = 2. The non-PNC opposite-phase algorithm achieves 
0.8 dB gain in the low SNR regime and 0.25 dB gain over 
the basic scheme in the high SNR regime. 

Observation 2: For equal-SNR relaying, physical-layer net- 
work coding can be applied as a relaying method to improve 
throughput performance significantly. For pairwise transmis- 
sion, the PNC phase-aligned algorithm achieves significant 
gains over any other non-PNC scheme of zero-forcing relay- 
ing. 

We present simulation results on the PNC phase-aligned 
scheme in Fig. |4] and Fig. [5] When we apply physical-layer 
network coding in our MIMO switching, significant gains can 
be achieved over other schemes. The proposed PNC phase- 
aligned scheme outperforms all the other non-PNC schemes. 



Note in particular that compared with the basic scheme, it 
does not involve complicated calculations during the one- 
dimensional search of A and the setting of B. However, the 
PNC phase-aligned scheme can not be applied to non-pairwise 
transmissions. 

Observation 3: For equal-SNR zero-forcing relaying with 
the non-PNC random-phase algorithm, the simulation results 
indicate that large gains can be achieved with a small number 
of phase bins and trials. For network-coded relaying, the 
PNC identical-rj random-phase algorithm can be applied for 
both pairwise and non-pairwise transmissions to achieve the 
network coding gain. It is worth mentioning that network 
coding helps not only for the traditional pairwise switching 
pattern but also for the non-pairwise pattern. 

In Figs, g] and |5] when M — 8 and L = 10, the non-PNC 
random-phase scheme proposed in Section IV.A can achieve 
good enough throughput performance. With the PNC random- 
phase algorithm proposed in Section IV.B, the throughput 
performance is even better than that of the best non-PNC 
scheme. For M = 8, it achieves around 1.2 dB gain with 
L = 10 and 1.4 dB with L = 100 compared to the basic 
scheme in the case of two stations; it can achieve around 1.5 
dB and 2 dB gains with 10 and 100 trials, respectively, over 
the basic scheme in the case of four stations. However, it needs 
larger M, L and more random trials to perform as good as 
the PNC phase-aligned scheme. 

Consider the non-pairwise transmission in Fig. [6] Note that 
the non-PNC opposite-phase scheme and the PNC phase- 
aligned scheme can not be applied to non-pairwise trans- 
missions. The non-PNC random-phase scheme and the PNC 
random-phase scheme can still outperform the basic scheme. 
For M = 8, when L = 10 the non-PNC random-phase scheme 
could achieve around 0.15 dB gain. The PNC random-phase 
scheme could achieve around 1.1 dB and 1.3 dB gains over 
the basic scheme with 10 and 100 trials, respectively. 

Observation 4: In general, for zero-forcing (i.e., non-PNC) 
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Fig. 6. Throughput comparison of different relaying schemes for non- 
pairwise switching pattern in the case of four stations. 



relaying, the throughput of the maxmin problem is larger than 
that of the equal-SNR problem. However, the throughput gap 
between the equal-SNR problem and the maxmin problem is 
small over a wide range of SNR. 

We first evaluate the throughput performances with two 
stations. An interesting question is how large is the throughput 
gap between ( f30b and d46i i corresponding to their optimal noise 
powers. In Fig. |U the throughput gap between the two curves 
is very small. Note that the optimal solutions of the maxmin 
problem are found by exhaustive search. In Table I, the gap 
is evaluated over a wide range of SNR. The results indicate 
that the gap is less than 0.1% for the SNR regime from to 
30 dB. 

In the case of four stations, the exhaustive search for solving 
the maxmin problem becomes computationally expensive. 
Therefore, we use the upper bound of the maxmin problem 
calculated by the SDR scheme for benchmarking instead. 
Generally, the throughput gap between the equal-SNR and the 
upper bound is small, and becomes even smaller in the high 
SNR regime in Fig. [5] and Table II. Since we use the upper 
bound for the maxmin solution for benchmarking, we conclude 
that the throughput gap between the maxmin problem and the 
equal-SNR problem is also small. 

Therefore, the maxmin problem can be well approximated 
by the equal-SNR problem. We note that the trend as indicated 
by the simulations results in Fig. |H Fig. [5] Table I and Table 
II is consistent with the analytical result of Proposition [3] In 
the high SNR regime, the noise becomes negligible, and the 
gap between equal-SNR and maxmin diminishes. 

Observation 5: The throughput of the equal-SNR problem 
is roughly the same as the approximate throughput of the 
maxmin problem achieved by the SDR technique for pairwise 
switching pattern. However, for the non-pairwise pattern, the 
SDR technique achieves good throughput performance for 
both non-PNC and PNC relaying schemes. 

With reference to Table I and Table II for pairwise switching 



pattern, for zero-forcing relaying our non-PNC opposite-phase 
algorithm is better than the SDR scheme in the high SNR 
regime, and for network-coded relaying the SDR scheme is 
better than our PNC phase-aligned algorithm. However, the 
gap is mostly smaller than 2% for SNR larger than dB. 

For the non-pairwise switching pattern, the random-phase 
scheme is close to the SDR upper bound for the non-PNC 
relaying. However, for the PNC relaying, the iterative SDR 
scheme outperforms the PNC identical-6 random-phase al- 
gorithm since the latter needs more trials to achieve better 
throughput performance. 

The overall implications of our analytical and simulation 
results are as follows. The equal-SNR scheme, with the target 
of achieving perfect fairness among the links, is also a good 
approximation to the maxmin problem when the relay noise is 
small. Given a symmetric switch matrix that realizes pairwise 
transmissions, we could use the non-PNC opposite-phase 
algorithm or the non-PNC random-phase algorithm for zero- 
forcing relaying, and the PNC phase-aligned algorithm or the 
PNC identical-6 random-phase algorithm for network-coded 
relaying, to identify a suitable gain vector. The PNC phase- 
aligned algorithm has good throughput performance as well as 
fast execution time. Given an asymmetric switch matrix that 
realizes non-pairwise transmissions, we could use the random- 
phase algorithm for zero-forcing relaying and network-coded 
relaying to identify a suitable gain vector. The SDR scheme 
also achieves good throughput performance; however, its com- 
plexity is generally higher than the equal-SNR schemes. 



VI. Conclusion 

We have proposed a framework for wireless MIMO switch- 
ing to facilitate communication among multiple wireless 
stations. With optimized precoders, network-coded relaying 
improves the throughput performance significantly over non- 
network-coded relaying. 

The maxmin solution and the equal-SNR solution have their 
respective advantages. The former yields better throughput 
performance. However, the equal-SNR solution guarantees 
perfect fairness. With the equal-SNR solution, MIMO switch- 
ing can be easily extended to multiple transmissions, by 
which general transmission patterns can be realized, including 
unicast, multicast, broadcast, or a mixture of them lfT8l . More- 
over, the maxmin problem is NP-hard, which is solved using 
exhaustive search. Even for the approximation of the SDR 
scheme, SDP problems using interior point methods has the 
complexity cost at most 0{N 7 ) [17|. However, the proposed 
schemes for the equal-SNR problem only has the complexity 
cost of 0(N 3 ), which is mainly induced by calculating ma- 
trix inverses. Hence, the equal-SNR problem is practical for 
implementation. For the above reasons, the equal-SNR setting 
is perhaps more amenable to practical deployment. 

In future work, it will be interesting to explore switch 
matrices that realize more complicated patterns than unicast. It 
would also be interesting to study the case where the number 
of antennas at the relay is fewer than the number of stations. 



TABLE I 

Worst-station Throughputs of the maxmin problem (exhaustive search & SDR) and throughputs of the equal-SNR problem 

(NON-PNC OPPOSITE-PHASE & PNC PHASE- ALIGNED ALGORITHMS) WHEN N = 2; A DENOTES THE VARIATIONAL RATIO W.R.T. THE EQUAL-SNR 
SOLUTION OF EITHER NON-PNC OPPOSITE-PHASE OR PNC PHASE-ALIGNED ALGORITHMS, RESPECTIVELY. 



Non-PNC PNC 



SNR 
(dB) 


Equal-SNR 


Maxmin 


Equal-SNR 


Maxmin 


Opposite-phase 


Optimal/A(%) 


SDR/A(%) 


Phase-aligned 


SDR/A(%) 





0.1270 


0.1271 / 0.08 


0.1270 / -0.00 


0.2305 


0.2363 / 2.52 


10 


0.7990 


0.7994 / 0.05 


0.7987 / -0.03 


1.2091 


1.2264 / 1.43 


20 


2.1249 


2.1256 / 0.03 


2.1202 / -0.22 


2.7250 


2.7549 / 1.10 


30 


3.7075 


3.7083 / 0.02 


3.6864 / -0.57 


4.3762 


4.4016 / 0.58 



TABLE II 

Worst-station Throughputs of the maxmin problem (SDR upper bound & SDR) and throughputs of the equal-SNR problem 

(NON-PNC OPPOSITE-PHASE & PNC PHASE- ALIGNED ALGORITHMS) WHEN N = 4; A DENOTES THE VARIATIONAL RATIO W.R.T. THE EQUAL-SNR 
SOLUTION OF EITHER NON-PNC OPPOSITE-PHASE OR PNC PHASE-ALIGNED ALGORITHMS, RESPECTIVELY. 



SNR 
(dB) 




Non-PNC 




PNC 


Equal-SNR 


Maxmin 


Equal-SNR 


Maxmin 


Opposite-phase 


SDR-Upper/A(%) 


SDR/A (%) 


Phase-aligned 


SDR/A(%) 





0.0661 


0.0720 / 8.85 


0.0720 / 8.85 


0.1266 


0.1376 / 8.69 


10 


0.5504 


0.5562 / 1.05 


0.5552 / 0.87 


0.8842 


0.9018 / 1.99 


20 


1.7314 


1.7328 / 0.08 


1.7302 / -0.08 


2.3302 


2.3435 / 0.57 


30 


3.2906 


3.2918 / 0.04 


3.2818 / -0.27 


3.9611 


3.9734 / 0.31 
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